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Biological processes involve a variety of spatial and temporal scales. A holistic under¬ 
standing of many biological processes therefore requires multi-scale models which capture 
the relevant properties on all these scales. In this manuscript we review mathematical 
modelling approaches used to describe the individual spatial scales and how they are 
integrated into holistic models. We discuss the relation between spatial and temporal 
scales and the implication of that on multi-scale modelling. Based upon this overview 
over state-of-the-art modelling approaches, we formulate key challenges in mathemati¬ 
cal and computational modelling of biological multi-scale and multi-physics processes. 
In particular, we considered the availability of analysis tools for multi-scale models and 
model-based multi-scale data integration. We provide a compact review of methods for 
model-based data integration and model-based hypothesis testing. Furthermore, novel 
approaches and recent trends are discussed, including computation time reduction us¬ 
ing reduced order and surrogate models, which contribute to the solution of inference 
problems. We conclude the manuscript by providing a few ideas for the development of 
tailored multi-scale inference methods. 

Keywords: Systems Biology, Computational Biology, Multi-scale Modelling, Parameter 
Estimation, Model Selection, Surrogate Models. 
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1 Introduction 

The key aim of mathematical, computational and systems biology is to achieve a holistic understanding 
of biological systems [1]. Over decades this aim was mainly approached by moving from the study 
of single molecules to the analysis of biological networks. Starting with the groundbreaking work of 
Hodgkin & Huxley [2], who described the dynamics of individual neurones using ordinary differential 
equations, a successful revolution began, which resulted in a significantly improved understanding of 
gene regulation, signalling, metabolism and many other processes [3]. Large-scale models for individual 
pathways and collections of pathways have been developed (see e.g. [4, 5, 6, 7, 8, 9, 10]) along with 
powerful theoretical and computational methods. 

Despite great successes it has been recognised that the study of cellular networks is only the first step, 
as multi-cellular organisms are more than a collection of pathways. A mechanistic understanding of 
complex biological functions and processes requires the consideration of multiple spatial and temporal 
scales. During the last two decades, multi-scale models for a variety of processes have been developed 
such as the beating of the heart [11, 12], the development of cancer [13] in all its subtypes, the drug 
delivery and action [14, 15], or the data processing in our brain [16]. These multi-scale and multi¬ 
physics models were derived by expanding existing models using top-down, bottom-up or so called 
middle-out approaches [17], as well as by integrating existing models for different aspects. 

As the development of coherent multi-scale models requires expertise on different biological scales, 
large consortia have been formed, bringing together researches to tackle complex questions. One 
great example is the World Physiome Initiative with the Physiome Project [12, 18], which unites 
researches from all around the globe. Among others, the Physiome Project demonstrated that multi¬ 
scale models can indeed deepen our insights [19], e.g., of the human heart, by integrating models for 
different processes. Depending on the question, a significant benefit in general does not even require 
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the consideration of all possible (intermediate) biological scales [15]. 

Despite the demonstrated applicability of multi-scale models, many important theoretical results and 
computational methods for their analysis are still missing. Multi-scale models are often obtained by 
coupling different model classes, e.g., logical models, ordinary differential equations (ODE), partial 
differential equations (PDE) or agent-based models, and a coherent theory for model properties such 
as stability, sensitivity and bifurcations, is mostly not present. More importantly from a practical 
point of view, methods for the model-based multi-scale data integration are hardly available. Model- 
based data integration via parameter estimation [20] is essential for the evaluation of the consistency 
of models and the underlying hypotheses with experimental findings. It enables hypothesis testing and 
generation, uncertainty quantification, as well as the prediction of processes dynamics under unknown 
conditions. Model-based data integration was maybe the most important reason why mechanistic 
pathway models were so successful in the past (see [21]). To exploit the full potential of multi-scale 
models the collection of rigorous theoretical and computational methods has to be extended. 

The field of multi-scale modelling has grown rapidly over the last years. Recent reviews focus on the 
modelling and simulation [12, 22, 23, 24]. Methodological challenges and potential solutions, e.g. in 
the context of parameter estimation for multi-scale models, are hardly addressed by the published 
reviews. As this discussion is in our opinion urgently necessary, we complement previous reviews 
by focussing on multi-scale model inference, in particular on methods for optimisation, uncertainty 
analysis and model selection. Furthermore, we discuss novel approaches which can help to advance 
inference of multi-scale models. We introduce concepts to decrease the computational complexity 
associate to multi-scale simulations, including reduced order and surrogate modelling. Furthermore, 
interesting theoretical concepts for the analysis of interconnected systems are highlighted. 

The review is structured as follows: In Section 2 common single- and multi-scale modelling approaches 
are summarised from a mathematical perspective. Furthermore, dependencies between spatial and 
temporal scales are discussed, along with a few groundbreaking contributions to the field of multi¬ 
scale modelling. In Section 3, we outline available analysis tools for multi-scale models, discuss the 
reasons for the limitations and provide suggestions for future research approaches. In Section 4 - 
the central part of this paper - we turn our attention to the data-driven modelling of multi-scale 
biological processes. We outline available optimisation and uncertainty analysis methods for model- 
based multi-scale data integration, discuss the limitations arising from computational complexity and 
outline methods which could be used to overcome them. The manuscript concludes in Section 5. 


2 Modelling 

The term “mnlti-scale models” is widely used but for many years the scientific community lacked an 
appropriate definition. One has recently been provided by the Interagency Modelling and Analysis 
Group (IMAG) [25]: 

“Multi-scale, biomedical modelling uses mathematics and computation to represent and simulate a 
physiological system at more than one biological scale. Biological scales include atomic, molecular, 
molecular complexes, sub-cellular, cellular, multi-cell systems, tissue, organ, multi-organ systems, or¬ 
ganism, population, and behaviour. These multi-scale biomedical models may also include dynamical 
processes which span multiple time and length scales. ” 

This definition is, like many others, very broad, in order to capture a complete research field. In the 
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following, we will try to provide some more details on modelling approaches for individual biological 
scales and their integration into multi-scale models. Furthermore, we will shortly discuss the relation 
of time and length scales. 

2.1 Modelling of individual biological scales and processes 

In bio- and life-sciences, many different multi-scale modelling approaches are used. This variety of 
multi-scale modelling approaches is caused by the variety of modelling approaches for the individual 
biological scales (see Figure 1 and discussion below). In the following, we outline the most common 
modelling approaches and model classes used for the study of individual spatial scales. Our under¬ 
standing of biological scales coincides with the IMAG definition for multi-scale, biomedical modelling 
but to ensure stringency some scales are lumped together. For the different modelling approaches, 
we will outline key mathematical properties. Furthermore, we summarize how these properties are 
exploited to capture the processes occurring in a particular scale. 

2.1.1 Atomic and molecular scale 

The building blocks of biological systems are deoxyribonucleic acids (DNAs), ribonucleic acids (RNAs), 
proteins, lipids, and metabolites. The structure, formation (e.g., folding) and interaction of these 
biomolecules is commonly described and analysed using quantum mechanic (QM) models and molec¬ 
ular mechanic (MM) models [26]. Quantum mechanic models provide the most detailed description of 
atomic and subatomic processes. For their simulation ab initio, density function and (semi-)empirical 
methods are available [26]. As these methods are computationally demanding, even on very short 
time-scales, only small molecules can be analysed using fully quantum mechanic models. To simulate 
larger biomolecules, molecular mechanic models are used. These models exploit a cruder description 
of molecules and describe atoms as point masses and point charges which interact via spring-like inter¬ 
actions and van der Waals forces [27]. Mathematically, molecular mechanistic models can be viewed 
as systems of agents which interact and these dynamics are captured by differential equations. To 
capture important details, e.g., in reaction centre of an enzyme, while being able to simulate large 
molecules, quantum mechanic / molecular mechanic (QM/MM) models have been developed [28]. To 
enable the study of larger biomolecules, coarse-grained and lattice models are employed. These models 
represent groups of atoms by a point mass and charge [29]. 

2.1.2 Sub-cellular and cellular scale 

The interaction of biomolecules gives rise to biological pathways and processes, such as gene regu¬ 
lation, signal transduction and metabolism. These processes have been modelled using for example 
continuous-time discrete-state Markov chains (CTMCs) [30, 31, 32], stochastic differential equations 
(SDEs) [32], ordinary differential equations (ODEs) [3], partial differential equations (PDEs) [33, 34], 
agent-based models [35], stochastic or deterministic Boolean models [36, 37, 38], Petri nets [39, 40] 
and constraint-based models [5]. Boolean and Petri net models are often referred to as qualitative 
models, as other model classes enable a more quantitative description. 

Gene regulation is mostly modelled using CTMCs [41], Boolean models [42] and Petri nets [40]. These 
models capture the discreteness and the stochasticity of the processes arising from the small copy 
number of genes. The dynamics of signalling pathways are mostly described using SDEs [43, 44] 
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and ODEs [2, 4], as large abundances of the involved biochemical species supports a continuous 
interpretation in terms of concentrations. For the assessment of the qualitative properties of signalling 
pathways, also Boolean models are used [8]. As the availability of spatial information increases, also 
PDEs, spatially resolved CTMCs and agent-based modelling approaches are more commonly used [34, 
45]. For the description of the metabolism often ODE-based formulations are exploited. Under 
the assumption of fast temporal dynamics, these ODE models can be used to derive constraint-based 
formulations which merely require information about the stoichiometry of the biochemical reactions [5] . 

2.1.3 Multi-cellular and tissue scale 

The variability of individual cells and the dependence of cellular dynamics on the current cell state, the 
history of individual cells, environmental condition and/or the spatial location in the tissue is captured 
by cell population models. The first population model has been introduced by von Foerster [46]. It 
described populations of non-interacting, proliferating cells. This groundbreaking work on structured 
population models - systems of PDEs ~ has been followed by theoretical contributions and model 
extensions (see [47, 48, 49, 50, 51, 52] and references therein). To study stochastic dynamics of 
gene regulation and signal transduction in non-dividing, heterogeneous cell populations, the Chemical 
Master Equation (CME) [53] and the Fokker-Planck equation (FPE) [54] are employed. To account for 
mixed, stochastic and deterministic sources of cell-to-cell variability, also extensions to the Chemical 
Master Equation [55, 56] and the Fokker-Planck equation [57, 58] have been introduced. 

The spatial organisation present in biological tissues is typically captured by using either agent-based 
or PDE models. The first model class uses agents to represent cells: either each cell individually (1 
agent = 1 cell) [13, 59, 60, 61, 62, 63], or coarse-grained compartments of several cells (1 agent = 
many cells) [64], or a single cell by an ensemble of agents (many agents = 1 cell) [65, 66]. The latter, 
e.g. cellular Potts models, enable a detailed description of cell-cell interaction and cell movement. 
These models allow for direct interactions of cells. An alternative to agent-based models are provided 
by cellular automata [67, 68, 69], which abstract the detailed biological interactions to rules. On the 
other hand, the second model class - PDE models - provide a more macroscopic view and describe 
the cell densities as continuum in space and capture their temporal evolution. There is significant 
work available on PDE models for avascular tumour growth [70, 71], angiogenesis [72], developmental 
processes [42, 73], bacterial biofilms [74] and many other important processes. 

2.1.4 Organ scale 

Organs consist of many different tissues and possess a complex spatial organisation, the length scales 
are however too large to model individual cells. To enable organ-scale simulation models, homogeni¬ 
sation methods have been developed. These methods originated in the field of porous media and use 
local averaging to enable PDE-based description of flow, transport and biomechanics processes in tis¬ 
sues as well as intracellular dynamics [12, 75, 76, 77, 78, 79]. The coupling of intra- and extracellular 
dynamics yields bidomain models [80]. Commonly, these bidomain PDE models are coupled with 
ODE models for the vascular transport [81, 82]. 

As organs possess different functions, their physiological characteristics are diverse. This diversity 
is also reflected by the models. Models of the heart [12, 75] and other muscles [79] focus on tissue 
mechanics while models of the lung [77, 81], the brain [78, 82], the craniocervical region [83], and solid 
tumours [76] focus on transport and delivery processes. Accordingly, the underlying mathematical 
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Figure 1: Visual summary of the types of mathematical models used to describe biological processes 
on different scales. A coloured box indicates that the types of mathematical models are routinely used 
for models of the respective spatial scale. The spatial scales are split up in subcategories to allow for 
additional insight. 
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descriptions and equation types differ. 

2.1.5 Multi-organ and organism scale 

On the organism scale, the properties of and crosstalk between organs are studied, e.g., in the context 
of drug delivery and treatment. The crosstalk between organs is mostly mediated via the vascular and 
the lymphoid system, which are used for the transport of substances between organs. Additionally, 
the nerve systems mediates information exchange between sensory organs and muscles with the brain. 

Models for the organism scale consider the body physiology [84], e.g., the volume of organs and 
large blood vessels as well as the available area for substance exchange and vessel permeability. 
These physiological parameters, which are often well accessible, allow for the parameterisation of 
multi-compartment models. The individual compartments are organs and large blood vessels. In 
pharmacokinetics (PK), ODE-based multi-compartment models are used to describe liberation, ab¬ 
sorption, distribution, metabolisation and excretion of drugs [85, 86]. These pharmacokinetic mod¬ 
els have been combined with pharmacodynamic (PD) models which describe the biochemical and 
physiological effects of drugs on the body [87, 88, 89]. The resulting physiology-based pharmacoki¬ 
netic/pharmacodynamic (PK/PD) models [14] are widely used in the pharmaceutical industry for 
drug discovery, decision making and data integration [90]. 

Most PK/PD models rely on a mathematical description in terms of DDEs. To capture fast metabolic 
processes, recently also hybrid models have been introduced which combine DDEs and constraint- 
based modelling [91]. Furthermore, in larger consortia like the Physiome Project [18] and the Virtual 
Liver Network [92, 93], PK/PD models are integrated with PDF and agent-based models for a refined 
description of the organs of interest. 

2.1.6 Population and behaviour scale 

As inter-individual variability is pervasive, personalised PK/PD models have been developed. These 
personalised models account, among others, for differences in gender, age, height, body mass, general 
health and genetic alterations [14]. On the population scale, this yields mixed-effect models [94, 
95]. These mixed-effect models often rely on an ODE-based description of individuals and provide a 
description of the population dynamics in terms of a high-dimensional system of PDFs, a Liouville- 
type equation. As the dimensionality is prohibitive for a direct simulation of the PDE, representative 
sets of ODE simulations are used to evaluate the population statistics. The corresponding results can 
nowadays be exploited in personalised medicine [14]. 

In social sciences, mathematical modelling is employed to study the interactions of individuals and 
the dynamics of groups [96, 97]. Therefore, agent-based models are frequently used as they provide 
a natural description of many processes, e.g., evacuation or traffic. Accordingly, they can be used to 
capture and to predict emergent phenomena. 

A visualisation of the modelling approaches exploited to describe individual biological scales is provided 
in Figure 1. We find that the class of modelling approaches used to capture processes on the cellular 
scale is most versatile. This is probably related to the focus of systems biology on the cellular scale. 
Furthermore, ODE and PDE models are used across scales and are the most common modelling 
approaches. Reason for this are most likely the existence of well established, deep theoretical results for 
these models [98, 99] and the availability of computationally efficient simulation methods (see [100, 101] 
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and references therein). 

2.2 Multi-scale modelling 

In the previous section various modelling approaches for biological processes have been outlined. The 
reason for the variety is the multi-scale and multi-physics nature of biological systems. To span 
different scale and processes, different models / model classes are necessary and have to be coupled. 
In the following, we will shortly outline different multi-scale modelling philosophies. We will discuss 
simulation and coupling approaches and conclude with a number of well-known multi-scale models. 
Throughout the chapter we will use the terms micro-scale and macro-scale to indicate small and large 
spatial scales respectively. 

2.2.1 Aims of multi-scale modelling 

The ultimative aim of multi-scale modelling is to describe the macroscopic behaviour of complex 
systems using microscopic models derived from first-principles. Accordingly, many scientists argue 
that: 

“The task of multi-scale modelling is to carry out macro-scale simulations without making use of ad 
hoc constitutive relations. Instead the necessary micro-scale constitutive information is extracted from 
micro-scale models.” [102] 

This would allow for a holistic analysis based on fully mechanistic descriptions. While the defined 
task is appealing, the majority of multi-scale modelling in biology follow a different philosophy. 

In biology, a variety of different physics have to be covered, ranging from molecule-molecule inter¬ 
action over signalling all the way to fluid and tissue mechanics. As a simultaneous first-principle 
modelling of these different physics is currently beyond reach, many research projects focus on the 
coupling of established macroscopic models to capture and study the multi-physics nature of biological 
systems [12]. Examples are multi-scale models for pharmacokinetic and pharmacodynamic models, 
which describe fluid transport as well as intracellular signalling [14, 15]. In a preliminary step the 
individual macroscopic models are derived from first-principles and than used in coupled multi-scale, 
multi-physics simulations. As modelling of individual processes is relatively well established, many 
systems and computational biologists currently aim at linking models of different processes. 

The two aforementioned aims 

Aim 1: Multi-scale modelling using first-principles. 

Aim 2: Multi-scale and multi-physics modelling using coupled macroscopic models. 

possess a significant overlap. As the approaches and methods developed to reach the different aims 
are however different, we will in the following discuss them separately. In Section 2.2.2 we will discuss 
simulation methods for multi-scale models, while in Section 2.2.3 we focus on coupled multi-scale, 
multi-physics models. 
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2.2.2 Multi-scale modelling using first-principles 

The multi-scale modelling using first-principles relies on the simulation of macro-scale processes using 
models with micro-scale resolution. As this is often computationally infeasible, multi-resolution meth¬ 
ods [103] and equation-free modelling approaches [104] have been developed. Multi-resolution methods, 
including adaptive mesh refinement and multi-grid methods [105, 106], are numerical schemes which 
reduce the number of micro-scale model evaluations by performing a successive (local) refinement. 
While multi-resolution methods are well established, the consideration of computationally demanding 
micro-scale models is often problematic. To circumvent long-term micro-scale simulations, equation- 
free modelling can be used. Equation-free modelling exploits on-the-fly coupling of microscopic and 
macroscopic simulations [107]. A macroscopic model/solver is used, which might require missing 
macroscopic information. This missing macroscopic information is obtained using short microscopic 
simulations. Therefore, the microscopic model is first constrained using the available macroscopic data 
(= lifting), under these constraints the microscopic model is evaluated for a short time interval (= mi¬ 
croscopic simulator), and the resulting simulation data are used to constrain the macroscopic model 
(= restriction). Several equation-free modelling approaches have been proposed, among others, het¬ 
erogeneous multi-scale methods [107], (coarse) projective integration and the gap-tooth scheme [108]. 
All these methods treat microscopic models as black boxes and merely use simulation results, which 
led to the term equation-free modelling. 

2.2.3 Multi-scale and multi-physics modelling using coupled macroscopic models 

While the available mathematical results and computational approaches for the development of first- 
principles multi-scale models are promising, application examples in biology and medicine are limited. 
In biology the modelling of complex multi-physics systems is so far mostly approached by coupling 
models which describe different aspects of the underlying biological systems. 

A nice example is the integrated physiologically-based whole-body model recently published by Schaller 
et al. [15]. This model describes the glucose-insulin-glucagon regulatory system by coupling ODE 
models for whole-body pharmacodynamics, sub-organ distribution and intracellular metabolism. The 
individual models describe completely different aspects, and unlike the first-principle models described 
in Section 2.2.2, the models do not provide different resolutions (e.g., micro- and macro-scale). Even 
the same class of mathematical models, namely ODEs, has been used to capture the different processes. 

Multi-physics modelling requires the coupling of different models. This can be challenging, even if the 
individual models describe similar processes and evolve on similar time- and length-scales [109, 110]. 
Beyond interface variables, which are discussed in Section 2.2.4, tailored simulation approaches have 
to be introduced. Walpole et al. [24] introduced the terms series simulation, parallel simulation 
and integrated simulation to provide a nomenclature for categorising simulation approaches. Series 
simulation acquire information by simulating one scale and provide this information as input to the 
next level (without feedback). Parallel and integrated simulation require communication (of different 
intensities) between simulations. 

Parallel and integrated simulations are often computationally demanding. To circumvent this problem, 
approximations based on spatial homogenisation and time-scale separation have been introduced. The 
underlying assumption of spatial homogenisation is that in a particular volume element the process 
with the large length scale is roughly constant. As the process on the large length scale is constant 
over this volume element, a single evaluation of the small length scale process could be used to analyse 
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the respective volume [76]. On the tissue level, the homogenisation would basically assume that the 
concentration of a substance in a particular region of an organ is constant. Although this region 
might be occupied by many cells, as all cells perceive the same or at least similar environment, it 
is appropriate to assume that all cells behave similarly and merely a representative cell is evaluated. 
Using this and related homogenisation approaches, the effective resolution required on the smaller 
scales can be reduced, which decreases the overall complexity of the simulation [77, 111]. An alternative 
to spatial homogenisation is provided by the adaptive tabulation multi-scale approach [112, 113]. This 
method exploits that different simulations with similar input sequences and model parameters are 
required to reduce the computational complexity. 

In addition to the spatial properties, large differences in time-scales of different processes can be 
used to simplify simulations. This has already been exploited by Michaelis & Menten [114] for the 
study of enzymatic reactions as well as in many other fields, such as stochastic simulations [115]. 
In the last decades these ideas have been extended to multi-scale processes. Multi-scale numerical 
schemes nowadays allow for the efficient simulation of coupled ODE-PDE systems [80]. Eurthermore, 
in agent-based modelling time-resolved simulation of diffusion is circumvented based on time-scale 
separation [62, 63]. 

Remark: Biological processes span many spatial and temporal scales. Intuitively, one might assume 
that microscopic processes are faster than macroscopic processes and, indeed, we find a correlation 
(Eigure 2). However, if we are interested in a particular biological process this does not have to 
hold. The example of anti-cancer treatment, involving drug distribution on the organism scale and 
apoptosis/necrosis induction on the cellular scale, shows that microscopic processes can be slower than 
macroscopic processes. This has to be appreciated during model and algorithm development. 

2.2.4 Coupling of models and interface variables 

A key ingredient of multi-scale and multi-physics models developed to achieve Aim 1 and 2 is the 
coupling of models. Approaches to couple individual scales are as diverse as modelling approaches for 
individual biological scales. However, despite differences, an essential element of any coupling scheme 
is the definition of interface variables. These are properties of microscopic models which influence 
the macroscopic model or vice versa. We propose to distinguish two classes of interface variables 
and physical coupling types: input-output coupling and direct coupling. In input-output coupling 
each interface variables depends either on the state of the microscopic system or on the state of the 
macroscopic system (Eigure 3, left), but not on both. The interface variables are therefore inputs 
and outputs for the individual scales. In direct coupling, interface variables are states shared between 
microscopic or macroscopic model (Eigure 3, right). 

Examples for interface variables on the cellular and tissue scale are the instantaneous secretion rate 
of a molecule from a cell which enters as source term in the spatial simulation of the tissue, and the 
tissue level concentration of a diffusive ligand which is bound to individual cells. The instantaneous 
secretion rate depends only on the current cellular state. Hence, instantaneous secretion rates provide 
an input-output coupling. The ligand concentration is directly influenced by tissue-level diffusion and 
cellular binding. Accordingly, the state is shared between microscopic or macroscopic model, which 
establishes an overlap of the models and a direct coupling. This direct coupling is often captured by 
models for the interfaces. 

Models only containing input-output couplings are generally easier to simulate and to analyse as 
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Figure 2: Relation of spatial and temporal scales. For a variety of biological processes the spatial and 
temporal scales are roughly indicated. The list is not comprehensive, but we find a clear correlation 
of spatial and temporal scale. 


modularity is ensured [116]. The overall problem can be decomposed in subproblems for which tailored 
methods might be available [117]. Even toolboxes for generic (PDE) problems are available for input- 
output type coupling [118]. 

Multi-scale models including one or more direct couplings require integrated simulation methods. Such 
methods have been, for instance, developed for coupled quantum mechanic and molecular mechanic 
simulations [28]. In quantum mechanic / molecular mechanic (QM-MM) models, e.g., for enzymatic 
reactions, the active centres are resolved using quantum mechanic models, while the rest of the po¬ 
tentially large molecule is formulated using molecular mechanic models. Eor the combined model a 
joint energy function is derived which can be used for simulation. 

Remark: The coupling type depends on the choice of interface variables. Hence, (potential) degrees 
of freedom can be used to render analysis and simulation more tractable. 

2.2.5 Case studies 

The methods introduced in the previous sections have been used to develop multi-scale models for 
broad spectrum of biological processes. In this section, we will briefly present five well-known models 
and discuss the structure of the respective models. A visual summary of the case studies is depicted 
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Figure 3: Illustration of input-output coupling and direct coupling. Input-output coupling provides 
an instantaneous interaction, while in direct coupling the interface possesses dynamics and is shared 
between micro- and macro-scale models. 

in Figure 4. 

Whole-heart model: The heart was among the first organs for which holistic models were developed [12]. 
As part of the Physiome Project, bidomain models - coupled ODE-PDE models - for cardiac elec¬ 
tromechanics and cardiac electrophysiology have been developed [119, 120, 121]. Based on 3D imaging 
data, molecular data and electrocardiograms, these whole-heart models account for tissue deformation 
and heterogeneities, transport processes, as well as intracellular signalling. The different macroscopic 
models are mostly linked using input-output coupling and integrated simulations are employed. A 
transfer of these models into clinical practice, e.g., to improve diagnosis and risk assessment, seems 
feasible. Ischemic lesions have been detected in individual patients using model-based multi-scale data 
integration, and effect of drugs on pump functions can be predicted [121]. 

Cancer growth model: As cancers are among the most common causes of death, the field of inte¬ 
grative mathematical oncology developed. This field seeks to develop multi-scale models for cancer 
progression and the delineation of key cancer mechanisms [13]. To achieve this aim, hybrid discrete- 
continuum models describing discrete individual cells and continuous distributions of nutritions and 
signalling molecules are used. Single-cell dynamics are governed by stochastic or deterministic models, 
while cell-cell interaction is captured using agent-based on-lattice/lattice-based or off-lattice/lattice- 
free models [62] . The properties of this first-principle multi-scale model are evaluated using integrated 
simulations. Parameterised multi-scale models already contributed to the understanding of complex 
spatial phenomena, such as fingering [13] , and will probably play a key role in the development of per¬ 
sonalised treatment strategies. Key advantages of these model-based approaches are that the impact 
of genetic alterations on cellular phenotypes and ultimately on the cancer progression and treatment 
response might be predicted, and that patient-specific data about molecule characteristics, tumor form 
and stage can be accounted for [122]. 

Glucose-insulin-glucagon regulation model: Our understanding of diabetes - one of the world’s leading 
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health problems - has recently been improved using a multi-scale model for glucose-insulin-glucagon 
regulation. Using a high-dimensional system of ODEs, the pharmacokinetics and pharmacodynamics 
of glucose, insulin, and glucagon are modelled and coupled [15]. This ODE system has been obtained 
by interface coupling an multi-organ model and sub cellular model. Using multi-scale experimental 
data, this model has been parameterised and adapted to individual patients. The resulting patient- 
specific models provided accurate predictions for glucose, insulin and glucagon dynamics after food 
intake. Model-guided development of novel diabetes treatment strategies, hence, becomes feasible. 

Liver lobule model: A variety of models improved our understanding of tissue scale processes. Models 
for liver lobules, the smallest organisational units of the liver, are particularly interesting examples [63]. 
Using high-resolution 3D imaging, realistic hybrid discrete-continuum approaches (see explanation 
above) of liver lobules have been developed. These 3D models facilitated an accurate description of 
toxihcation and repopulation processes. Predictions about the cell alignment made using these models 
could be conhrmed experimentally and enabled and improved understanding of cell-cell interaction. 

Whole-cell model: A groundbreaking contribution to cell biology has been made by the whole-cell 
model for the human pathogen Mycoplasma genitalium [123]. This model captures all essential cel¬ 
lular processes, including gene regulation, metabolism, signalling and cell division. The individual 
processes are described using Boolean and probabilistic models, constraint-based models and ODEs. 
Eor numerical simulation an integrated simulation approach has been employed, which enables the 
assessment of single-cell and population dynamics. As a key feature the whole-cell model could predict 
transciptomics, proteomics and metabolomics data as well as the effect of gene knockouts with a sur¬ 
prisingly high accuracy [123]. This confirmed that large-scale models derived from already established 
multi-scale data sources can facilitate biological discovery. 

In summary, biological processes are versatile and so are modelling approaches. There is no right or 
wrong, but it depends on the purpose of the model (and the personal preferences of the modeller). 
Multi-scale models are obtained by coupling models for different processes hierarchically or in parallel. 
The resulting models provide hrst-principle formulations for macro-scale processes or simply account 
for different physical processes. In both cases, a more holistic description of biological processes is 
achieved. In particular if its mathematical description is rigorously analysed and is parameters reli¬ 
able, novel insights can be gained. The rigorous analysis of multi-scale models as well as the estimation 
of their parameters from experimental data requires sophisticated methods. 

3 Analysis and simulation 

The mathematical modelling of biological processes requires (and facilitates) a rigorous formulation 
of hypotheses, the hrst steps towards mechanistic understanding. The derived models are, in general, 
studied using analytical and numerical methods to gain deeper insights. Of particular interest are 
often 


• existence and uniqueness of solutions, 

• sensitivity of solutions with respect to parameters, 

• local and global stability of steady states, 

• bifurcation structures of steady states, and 

• dynamic properties. 
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Figure 4: Selection of well-known multi-scale models. Biological scales captured by the model, math¬ 
ematical modelling approaches, topics and keywords, as well as a reference to an overview paper is 
provided. 


The analysis of these properties can already for individual model classes be demanding. While standard 
simulation frameworks for most model classes are available [14, 33, 124, 125, 126], for stochastic models 
even important tasks, such as sensitivity analysis [127, 128], pose challenges. This limits the study 
of the models and might explain the frequent usage of ODE and PDE models, for which a broad 
spectrum of methods is available, across spatial and temporal scales (Eigure 1). An illustration of the 
availability and sophistication of methods for different model classes is provided by Figure 5. 

In Section 2.2.5, we reviewed multi-scale modelling approaches and presented a few well-known applica¬ 
tion examples. Interestingly, a detailed evaluation of these examples revealed that for most multi-scale 
systems merely simulation studies are performed. A coherent theory, in particular for coupled multi¬ 
physics models, is often missing. Methods to study existence and uniqueness, sensitivity, local and 
global stability and bifurcation structures are urgently needed to gain a more profound understanding 
of multi-scale processes. Fortunately, some brave researchers work on such methods. Among others, 
new theoretical methods for existence and uniqueness analysis [74, 129, 130] and stability analysis [131] 
for multi-scale ODE-PDE models have been established. However, despite significant progress, the 
spectrum of models which can be analysed rigorously is still rather limited. This might be related to 
the following three challenges: 

Challenge 1.1: The applicability of established methods has not been extended to coupled multi-scale 

















15 


DATA-DRIVEN MODELLING OE BIOLOGICAL MULTI-SCALE PROCESSES 


Simulation (tools) 


Sensitivity analysis 


Optimization 


Identification and uncertainty analysis 


Model selection 


5 E 


= -a 


o 

u 


0) S 



•2 o 
c > 
o o 
u ^ 


c w 

'.s (A 

(0 

0) LU 

C LU 

LU 


0) Q 

« O 



fc 

0) ' 

"O (A 

:= (A 

(A 


^ C 

S'.2 

^ § 

(§ ™ 

£ 3 

JS « 

.£ 3 

■it 

O O- 

■C O' 

to O' 

o 0 ) 

o ® 

^ (U 


(A 


W 

0) 

•D 

O 


E 


(0 


<u 

u> 

(0 



Problem: 



solved 


open 


Figure 5: Visual summary of the development status of methods available for different model classes. 
The transparency indicates the degree to which the different tools and methods are established, respec¬ 
tively, for which model size they are applicable. Problems for which scalable, standardised approaches 
are available are considered to be “solved”. The colouring indicates trends but a quantitative inter¬ 
pretation is not intended. For a detailed discussion see Section 3 and 4.1. 


models. 

The list of methods for which this holds is extensive. Prominent examples are methods which exploit 
properties of the individual (decoupled) models to study the properties of the coupled system, e.g. 
the small-gain theorem [132]. In systems and control theory the small-gain theorem is used to prove 
stability of coupled ODE models. Although the underlying idea is simple, no general extensions to 
coupled ODE-PDE systems are available. 

Challenge 1.2: Directions from which the analysis of multi-scale models should he approached are 
often not obvious. 

Agent-based models for tumour growth [62] or tissue remodelling [63] for instance can either be in¬ 
terpreted as stochastic PDEs with complex noise term or as coupled Markov jump processes. These 
different perspectives require different mathematical and computational tools. Eor a satisfactory anal¬ 
ysis probably combinations of the existing tools and theoretical results are necessary and have to be 
established. 

Challenge 1.3: Classical concepts and tools are often not applicable to novel classes of multi-scale 
models. 

Properties can easily loose their meaning if alternative model classes are considered. While stability 
is doubtless one of the most important properties of deterministic systems, even for simple stochastic 
processes stability is not easily define [133]. Eor stochastic systems properties such as uni- or multi¬ 
modality of the instantaneous probability density [134] or statistics of noise-induced oscillation [135] 
are often more essential. As models and model classes change, the properties of interest have to be 
adapted and generalised. 

Despite the multilayer nature of these challenges, there is reason for hope. Walpole et al. [24] outlined 
in a recent review that most multi-scale models capture only a small subset of the scales. Accordingly, 
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not all combinations of model classes have to be considered, but already a few combinations might be 
sufficient. As coupled ODE-PDE systems and coupled CTMCs-PDE systems are widely used, these 
classes of multi-scale models might be appropriate starting points. 

In addition to analytical results, equation-free stability, sensitivity and bifurcation analysis methods 
have been developed [136, 137]. These methods allow for the simulation-based analysis of quantitative 
and qualitative properties of multi-scale methods. While the use of simulations allows for a certain 
independence from the underlying equations, the detailed mathematical understanding is lost. Eur- 
thermore, instead of analysing the mathematical model, often the numerical implementation is studied. 
To circumvent this, the simulation uncertainty can be quantified and taken into account [138, 139]. 

In summary, the scarcity of the available methods limits the analysis of multi-scale models. While we 
cannot provide a comprehensive overview here, the need for the development of novel tools is apparent. 
One possibility might be to exploit modularity and merely rely on the input-output characteristics of 
individual models. Although potentially conservative, methods relying on this idea might be widely 
applicable. 


4 Parameter estimation and model selection 

Mathematical modelling allows for the description of biological processes and the study / prediction of 
their properties. However, reliable quantitative results require accurate model structures and model 
parameters (e.g., reaction rates and diffusion coefficients). Due to experimental constraints, structures 
and parameters of biological systems can often not be assessed directly. Instead, they have to be 
inferred from limited, noise-corrupted data. Therefore, parameter estimation and model selection 
methods are required. 

Parameter estimation and model selection are means to integrate data from different data sources. 
This becomes increasingly important as the availability and variety of data increases continuously, 
due to new technologies. It has been shown that already model-based data integration on individual 
scale can significantly improve our biological understanding. As multi-scale models are significantly 
more complex, a further improvement has to be expected. Eurthermore, information on one scale can 
help to understand other scales. 

4.1 Challenges for data-driven multi-scale modelling 

Multi-scale models are obtained by coupling models for individual biological scale (Section 2.2.2 
and 2.2.3). Accordingly, the most naive approach is to perform parameter estimation and model 
selection for the individual scales separately. As parameter estimation and model selection are fields 
of active research since decades (see [20] and references therein), a number of breakthroughs have been 
made for individual model classes: 

• Boolean and constraint-based models: Boolean and constraint-based models are parameter- 
free and neither parameter estimation nor uncertainty analysis methods are required. For the 
reconstruction of Boolean models from experimental data optimisation-based methods are com¬ 
monly used. Their computational complexity scales reasonably with the network size and they 
are applicable to networks of several hundred components [140]. Constraint-based models are 
mostly derived from (literature) knowledge about the biochemistry of the underlying process. If 
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sufficient data are available also statistical approaches, such as Gaussian Graphical Models, can 
be used [141, 142], 

• ODEs: For ODE models robust and scalable inference methods are available [143, 144, 145]. 
Optimisation [146] and uncertainty analysis [147] of ODE models with a few hundred parameters 
is feasible on standard computer infrastructures. The methods are established for a wide range 
of applications and implemented in several software packages (see [146, 148] and references 
therein). The results allow for model selection and optimal experimental design. Model selection 
can become time consuming if many model alternatives have to be tested, and the genom-scale 
models with thousands of state variables and parameters are currently beyond reach [122, 149]. 

• PDEs: Inference for PDE models [150, 151] is generally more challenging than for ODE mod¬ 
els. Nonetheless, rigorous optimisation and uncertainty analysis of low- and medium-dimensional 
PDE systems with dozens of parameters is feasible [152, 153, 154, 155]. Based on existing simula¬ 
tion environments [156], toolboxes for PDE-constrained optimisation are being introduced [157] 
and are expected to replace problem-specific implementations. 

• SDEs and CTMCs: Standard Bayesian and frequentist approaches to parameter estima¬ 
tion consider the conditional probability of the data given the parameters, commonly known 
as the likelihood [20]. For stochastic models, such as CTMGs and SDEs, evaluating the like¬ 
lihood requires marginalisation over all possible paths of the stochastic process. This is often 
computationally intractable and limits the applicability of optimisation methods. Approxima¬ 
tions to likelihood functions [44] as well as likelihood-free estimation methods have been intro¬ 
duced [158, 159]. In particular. Approximate Bayesian Computing (ABC) methods are nowadays 
widely used [160] . These flexible methods enable the sampling of the posterior distributions, un¬ 
certainty and model selection, without evaluation of the likelihood function. Due to the large 
number of simulations, mosty problems with rather low and medium computational complexity 
have so far been studied [160, 161, 162]. For certain subclasses of models also software packages 
are available [161]. 

• Agent-based models: In agent-based modelling often interacting and non-interacting agents 
are distinguished. Models consisting of non-interacting agents are widely used in pharmacokinet¬ 
ics and pharmacodynamics, and are - as mentioned above - also denoted as mixed-effect models. 
For mixed-effect models standard inference methods are available [94, 163] and implemented in 
free and commercially available software packages [164]. 

Models of interacting agents are more difficult to parameterise than models of non-interacting 
agents. The reason is the need for integrated simulations of the complete system. The resulting 
computational complexity is challenging and mainly problem-specific solutions are currently em¬ 
ployed [165]. One reason is the variety of experimental data and their corresponding objective 
function types. 

The integration of data-driven models for individual scales to multi-scale models is reasonable and 
rather tractable. Therefore, this approach has been successfully applied in the literature several 
times [19, 166]. However, this approach suffers also from several limitations. It is clear that interface 
variables have to be consistent across scales. This cannot be guaranteed if model htting is done 
separately. Furthermore, for multi-scale models with direct coupling a separate data-driven modelling 
might not even be feasible. Beyond these technical aspects, experimental data used for the data-driven 
modelling of the individual scales might have been collected under different conditions. Accordingly, 
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the models might not be valid for the regimes required in a multi-scale model. Extensive studies of 
the transferability of models are currently not available. 

The aforementioned limitations establish the need for an integrated approach for data-driven multi¬ 
scale modelling. Beyond the data-driven modelling of individual scales, an important first step (if 
feasible), the multi-scale model has to be fitted to / validated with multi-scale data [24] . Examples for 
this have been provided in the context of blood cell mechanics [167] and heart electromechanics [121]. 

Parameter estimation, uncertainty analysis and model selection for the multi-scale models is apparently 
more challenging than data-driven modelling of the individual scales. The dimension of the parameter 
space increases and the handling of the models becomes more demanding. Key challenges are in 
particular: 

Challenge 2.1: Computational complexity for multi-scale simulation (and sensitivity analysis) is often 
large. 

Challenge 2.2: Stochasticity of many multi-scale models establishes the need for a large number of 
simulations. 

Challenge 2.3: Reproducibility of parameter estimation and model selection results have to be ensured. 
Challenge 2.4: Lack of tailored and generic computational tools for data-driven multi-scale modelling. 
Some of these challenges are inherited from models of particular scales. 

In the following, we will discuss novel approaches and ideas which could in the future be used to 
address these challenges. Thereby, we will provide a comprehensive list of potential methods helpful 
for data-driven multi-scale models. 

4.2 Optimisation 

To determine unknown model parameters, statistically motivated measures for the goodness of fit are 
used. These objective functions are optimised with respect to the parameters. For dynamical systems, 
the respective optimisation problems are in general nonlinear and non-convex. Hence, sophisticated 
optimisation schemes are required. Common tools employ multi-start local optimisation [146], mul¬ 
tiple shooting [143, 168], evolutionary algorithms [169, 170], pattern search [171] and particle swarm 
optimisation [171, 172]. Banga [144] and Weise [173] provide comprehensive surveys of local and global 
optimisation procedures. 

In the course of the optimisation, the goodness of fit is assessed at different points in parameter 
space. This requires the simulation of the model and ~ depending on the optimisation method - 
the evaluation of sensitivities (derivatives of the model output with respect to the parameters). The 
repeated simulation and sensitivity evaluation can be time consuming (Challenge 2.1), stochasticity 
(Challenge 2.2) and reproducibility (Challenge 2.3) are however even more intricate. 

Parameter optimisation for a number of multi-scale models has been approached in recent years. In 
particular for PK/PD models great successes have been reported (see, e.g., [15]). Here, different 
ODE models are coupled and simulation as well as parameter estimation remains efficient. The 
optimisation-based integration of experimental data collected on different scales provided novel insights 
in diseases, such as diabetes. Promising results have been obtained for mixed-effect models describing 
population and single-cell level [174], but reproducibility is often an issue. For coupled PDF models 
novel optimisation methods resulted in great successes, e.g., in the context of the Virtual Heart [121]. 
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For models composed of different model types, optimisation seems more challenging. This is also 
indicated by the aforediscussed contributions (Section 2.2.5). Hoehme et al. [63] introduced and 
validated a sophisticated agent-based model for liver regeneration. The available measurement data 
were however merely used to determine realistic simulation domains and rough parameter values. A 
parameter optimisation was not approached. The same is true for the whole-cell model developed by 
Karr et al. [123]. Reasons for this are stochasticity and computational complexity of model simulations. 

There are a series of promising mathematical methods which tackle Challenges 2.1 - 2.3. These meth¬ 
ods split the underlying optimization problem in smaller subproblems or decrease the computational 
complexity associated to model evaluations. We will shortly outline the key ideas underlying these 
methods. A visual summary is depicted in Figure 6. 

4.2.1 Decoupling using dependent inpnt approach 

The parameter estimation for models with high-dimensional parameter and state space is often chal¬ 
lenging. An intuitive idea is therefore to exploit the modularity of biological systems. The overall 
system can be decomposed into a set of interconnected subsystems [175]. The subsystems, for instance, 
describe individual biological scales or span different scales. A subsystems is interfaced with other 
subsystems. The dependent input approach regards these other subsystems as unmodelled dynamics 
and replaces them by fictitious ‘dependent inputs’ [176]. This enables the decoupling of subsystems, 
assuming that the dependent inputs can be measured in experiments. The subsystems can then be 
fitted separately, assuming that they do not share parameters. This eliminates the need for multi¬ 
scale simulations and addresses Challenge 2.1. Furthermore, it provides a tailored tool for data-driven 
multi-scale modelling (Challenge 2.4). 

The dependent input approach is rather flexible and in principle not limited to a particular class of 
models. It however requires that the dependent inputs are measured. This limits the decomposition 
and renders it also measurement dependent. Furthermore, subsystems require continuous input signals 
while measurement data are mostly collected at discrete time points. Interpolation and filtering 
approaches can be used to close these gaps [177], this can however be error-prone. More sophisticated 
approaches fit input data and subsystems dynamics simultaneously [178, 179]. 

The dependent input approach is widely used for optimisation [178, 179] as well as uncertainty analy¬ 
sis [180]. Open questions are mainly related to the securing of consistency between input and output 
signals of individual subsystems. 

4.2.2 Reduced order modelling 

The computational effort associated with numerical simulations is frequently the bottleneck for op¬ 
timisation. Model order reduction methods reduce the complexity of high-dimensional model, while 
preserving their input-output behaviour as much as possible [181]. The resulting reduced models, 
which can be simulated more efficiently, mimic the behaviour of the full model. 

For linear models efficient and reliable model order reduction methods are available [182], for instance, 
SVD-based [183] and Krylov subspace methods [184]. In the last decade these methods have been 
extended to linear [185, 186, 187, 188] and non-linear [189, 190] parametric (partial) differential 
equations. This enabled the use of reduced order models in optimisation [191]. Simulations of the 
full model are simply substituted by simulations of the reduced order model. To account for the 
approximation error, a-posteriori error bounds can be used [191, 192, 193]. 
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In multi-scale modelling, reduced order models are easily used to decrease the computational complex¬ 
ity of models for individual scales or processes. Furthermore, promising multi-scale model reduction 
methods have been developed which consider several scales simultaneously [194, 195]. Using these ap¬ 
proaches a speed-up of the numerical calculations by several orders of magnitude has been achieved, 
rendering parameter estimations feasible by addressing Challenge 2.1. 

4.2.3 Surrogate modelling 

Model order reduction methods exploit the structure of the governing equation. This limits the appli¬ 
cability of these methods - as the equations might become high-dimensional or highly non-linear - and 
led to the development of surrogate modelling approaches. Surrogate models, also known as meta¬ 
models, response surface models and emulators, are scalable analytical models for the approximation 
of the multivariate input-output behaviour of complex systems. Surrogate models are derived from 
simulated input-output data and do not consider the internal structure of the original model. Popular 
surrogate models are polynomial response surfaces [196], Kriging [197], space mapping [198], support 
vector machines and radial basis function approximations [199]. Similar to reduced order modelling, 
surrogate modelling can be used to reduce the computational complexity of a single model evaluation 
(Challenge 2.1). 

Surrogate models have been used to approximate objective functions [199] as well as time-courses 
of (multi-scale) models [197, 198, 200]. Both types of surrogates have been used in surrogate-based 
optimisation. Surrogate-based optimisation methods circumvent evaluation of the computationally 
demanding model by evaluating the surrogate model. Based on a first space filling sampling, e.g., latin 
hypercube sampling, an initial surrogate model is derived (Step 1). This surrogate model is optimised 
(Step 2). At the new optimal point the full model is evaluated (Step 3) and the surrogate model is 
updated (Step 4) before returning to Step 2. The individual steps in surrogate-based optimisation 
are involved and there exist a variety of different approaches. For details we refer to [199, 201] and 
references therein. 

4.2.4 Multi-level Monte-Carlo methods 

For stochastic models a single evaluation of the objective function requires the averaging over stochastic 
simulations. This averaging can be computationally demanding because it normally requires many 
simulations. To decrease the necessary number of simulations and to address Challenge 2.2, multi-level 
Monte-Carlo methods have been developed [202]. Multi-level methods employ series of increasingly 
complex models. For SDEs, this series of models could be, for instance, different numerical SDE 
solvers with increasing accuracy [203]. Instead of estimating the mean behaviour of the full model 
using Monte-Carlo integration, e.g., a numerical SDE solver with a very high accuracy, the difference of 
adjacent models in the series is assessed. As this difference is smaller, fewer evaluations are necessary, 
often resulting in a speed up and a lowered variance of estimates [202]. Similar approaches have 
recently been introduced for CTMCs [204]. 

Multi-level Monte-Carlo methods often use approximate stochastic simulations. These methods on 
their own can already accelerate simulations. The simulation of CTMCs is, for instance, often approx¬ 
imated using tau-leaping [205], time-scale separation [206] or diffusion approximation [44]. 

Monte-Carlo integration induces stochasticity of the objective function. Two evaluations of the ob¬ 
jective function for the same parameter will provide slightly different results. This is a severe issue for 
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the commonly used deterministic optimisers, and therefore stochastic optimisation procedures have 
to be employed to ensure robustness [173], e.g., implicit filtering [207]. Such stochastic optimisers use 
involved update schemes, and tailoring of these schemes to the structure of the considered multi-scale 
model might be beneficial. In particular, block updates might improve the search performance [208]. 
Block update schemes, currently used in Markov-chain Monte-Carlo methods, can exploit the model 
structure to virtually reduce the problem size by updating merely parameters belonging to the same 
scale or the same process. 

4.2.5 Moment equations and system-size expansions 

Alternatives to Monte-Carlo integrations are provided by moment equations [209] and system-size 
expansions [210]. Moment equations and system-size expansions are deterministic models describing 
the statistics - mean, variance and higher-order moments - of the solutions of stochastic processes, 
i.e., CTMCs, which eliminates the need for repeated stochastic simulations (Challenge 2.2). A further 
advantages is that moment equations and system-size expansions are deterministic models which allow 
for the use efficient deterministic optimisers. For these optimizers the reproducibility is general good, 
addressing Challenge 2.3. The key disadvantage of moment equations and system-size expansions 
is that they merely provide approximations, as moment closure and/or truncations of infinite series 
are required [211]. Detailed studies of the influence of the approximation error on the parameter 
estimation are so far missing. To minimise the effects, hybrid methods have been developed, which 
use a fully stochastic description for low-copy number species and a moment-based description for 
high-copy number species [134, 212, 213, 214]. The resulting hybrid models are often more accurate, 
but their simulation is computationally also more demanding. 

Moment equations are available for concentrated as well as distributed processes. Already in the 1990s, 
spatial moment equations have been developed for individual-based models [215, 216]. These equa¬ 
tions have been successfully used to study population dynamics on regular lattices, irregular networks, 
and continuous spatial domains. Different closure schemes have been developed to provide appropri¬ 
ate approximation accuracies [217]. The consideration of long-range correlations, which improves the 
approximation accuracy but also increase the computation time, turned out to be particular critical. 


4.3 Identifiability and uncertainty analysis 

As experimental data are limited and noise-corrupted, parameter estimation has to be complemented 
by structural and practical identifiability. Structural identifiability is concerned with the structure 
of the system [218] and can be used to assess a priori whether the parameters can in principle be 
determined from noise-free data. In contrast, practical identifiability is related to a particular dataset 
and its information content [219]. Using practical identifiability analysis the parameter and prediction 
uncertainties are quantified in terms of asymptotic and finite-sample confidence intervals. 

Asymptotic confidence intervals can be computed via local sensitivity-based methods, e.g., the Wald 
approximation [220] or the Fisher information matrix (FIM) [221]. While these asymptotic confidence 
intervals are already determined by standard optimisers, these confidence intervals are not reliable. 
Therefore, finite sample confidence intervals derived using bootstrapping [222], profile likelihoods [219] 
and Markov chain Monte-Carlo methods [223] are preferred in practice. A recent study showed 
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Figure 6: Visual summary of methods which could potentially improve optimisation of multi-scale 
models; (A) decoupling using dependent input approach; (B) reduced order modelling; (C) surrogate 
modelling; and (D) computing solution statistics for stochastic processes using multi-level Monte- 
Carlo methods &: moment equations and system-size expansion). 


however, that also the use of bootstrapping-based confidence intervals are problematic [224], While 
profile likelihoods and bootstrapping require a large number of local or global optimisations, Bayesian 
methods exploit sampling of posterior distributions using, e.g., Markov chain Monte-Carlo methods. 
For multi-scale models using a coherent modelling of the scales, this is often feasible using current 
methods [15], while for most multi-scale models this is so far intractable. 

These limitations can be partially overcome using aforementioned reduced order and surrogate models, 
as well as, multi-level Monte-Carlo methods and moments equations. In addition, tailored and efficient 
identifiability and uncertainty analysis methods are being developed. A few of those will be introduced 
in the following. 

4.3.1 Structural identifiability analysis for interconnected systems 

Identifiability analysis uses the governing equations of dynamical systems to study whether parameters 
can in principle be determined [218]. Therefore, several approaches based on Taylor and generating 
series, implicit function theorem and differential algebra have been developed [225]. These methods are 
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applicable to ODE and PDE models. The key advantage of these methods is that they exploit symbolic 
calculations and do not rely on numerics. This makes them robust, but also limits their application to 
small- and medium-size systems. To overcome these challenges, approaches for interconnected systems 
have been developed [226, 227]. These methods exploit the modular structure of interconnected / 
large-scale systems and the input-output structure of subsystems to decide about global structural 
identifiability. Accordingly, these methods are well-suited for the study of multi-scale models and 
address Challenge 2.4. 

4.3.2 Simulation-based profile likelihood calculation 

In addition to the development of novel methods designed for multi-scale models, the efficiency of 
existing methods is continuously increasing. This allows for a more rigorous assessment of problems 
at hand and improves the reproducibility of results (Challenges 2.3). Among others, fast simulation- 
based profile likelihood calculations methods have been proposed [228]. These methods circumvent 
the repeated local optimization by formulating dynamical systems evolving along the profiles. Using 
gradient and hessian of the objective function, the update direction for the parameters is determined 
directly. The resulting path in parameter space describes the prohle likelihoods. First (unpublished) 
results indicate a significant acceleration. 

4.3.3 EfRcieut sampling methods 

To improve the efficiency of Bayesian uncertainty analysis, structure exploiting sampling schemes have 
been introduced. In particular, Riemann manifold and Hamiltonian Monte-Carlo methods provide a 
performance boost for a variety of applications [229, 230]. These methods use gradient and hessian of 
the objective function to construct a local approximation. By sampling from this local approximation, 
a significantly reduction of the autocorrelation of the Markov chain samples can be achieved compared 
to standard Markov chain Monte-Carlo methods. Similar efficiency increases could also be achieved 
using adaptive single- and multi-chain Monte-Carlo methods [147, 231, 232]. These adaptive samplers 
use the available sample path to construct an approximation of the local structure. Accordingly, the 
potentially demanding evaluation of gradient and hessian is circumvented. 

Samplers can also be combined with surrogates, e.g., radial basis function approximation of the pos¬ 
terior distribution [233]. Furthermore, surrogate-based sampling schemes have been developed [234]. 
For computationally intensive problems these approaches can outperform standard methods. This 
natural extension of surrogate-based optimisations should be explored in the future, in particular, as 
information collected during optimisation can be reused. 

4.3.4 Approximate Bayesian Computing 

The evaluation of likelihood functions, the statistical distance of model and data, is for many models 
difficult. Therefore, likelihood-free methods have been proposed [158], also known as Approximate 
Bayesian Computing (ABC) methods. ABC methods circumvent the evaluation of the likelihood 
function using model simulations [160] and thereby address Challenge 2.2. A simulation for a particular 
parameter value is accepted or rejected based on simple distance measures. For appropriate choices of 
the distance measure, ABC methods sample approximately from the Bayesian posterior distribution. 
These samples enable a direct assessment of parameter and predictions uncertainties. 
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To ensure efficiency and reliability of ABC methods, a variety of sophisticated sampling schemes 
have been developed. Popular are ABC Markov chain Monte-Carlo and ABC sequential Monte-Carlo 
methods [235]. For models with time-consuming simulations also surrogate-based approaches have 
been developed, i.e., approximate ABC (AABC) [236]. 

4.4 Model selection 

In biology not only model parameters but also model structures are often unknown. Hence, subse¬ 
quent to parameter estimation, model selection has to be performed to evaluate competing hypotheses. 
Therefore, a set of alternative models is defined and the best model is selected using likelihood ra¬ 
tio [237], Akaike information criterion [238], Bayesian information criterion [239], Bayes factors [240] or 
flavours of these criteria. If the set of alternative models is too large, forward and backward selection 
methods can be used to avoid a full enumeration and exploration. The aim of model selection is to 
determine the underlying biological and biochemical mechanism and to avoid over- and underfitting. 

Model selection requires parameter estimation for all model alternatives. An acceleration of parameter 
estimation methods, there also results in an accelerated model selection. In addition, more sophis¬ 
ticated model selection methods are being developed, e.g., more reliable algorithms for evaluating 
Bayes factors [241]. This is also important, as detailed studies of ABC-based model selection methods 
revealed. For stochastic models ABC-based model selection is the method of choice [235], however, 
if insufficient statistics (= error norms) are used, Bayes factors can select incorrect models [242]. 
Sufficiency conditions have been derived to avoid this problem [243]. 

Case studies showed that the combination of data collected on different levels can enhance the model 
selection. In a recent paper, it has been shown that model selection using gene expression and 
metabolite data allows for an improved inference of regulation mechanism compared to the individual 
models/datasets [244]. The reason for this improvement is that information about adjacent levels 
provide additional constraints for the model. Hence, model selection on individual scales has to be 
complemented by multi-scale model selection. In our opinion appropriate methods to tackle this prob¬ 
lem are currently not available. Multi-scale models are mostly selected based on heuristic arguments 
and visual inspection. 

In summary, this section provided an overview about state-of-the-art methods for data-driven multi¬ 
scale modelling. Challenges have been outlined and novel methods and ideas have been discussed. The 
discussed methods and ideas might revolutionise multi-scale data integration by shifting the bounds 
and allowing for much higher-dimensional problems. 


5 Conclusions and outlook 

A multitude of modelling approaches is used to describe biological processes and to unravel the under¬ 
lying working principles. This review provides an overview over the spectrum of different (multi-scale) 
modelling concepts, thereby not aiming for complete comprehensiveness. Qualitative and quantitative 
modelling approaches have been outlined along methods to integrate them in multi-scale models. 

To analyse multi-scale models sophisticated tools are necessary, whose development poses interesting 
mathematical challenges. Existing theory has to be extended (Challenges 1.1) and different mathe¬ 
matical disciplines have to be linked (Challenges 1.2 & 1.3). To facilitate this process and to bundle 
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research activities, standard classes of multi-scale models should be dehned. 

Beyond modelling and model analysis, we should also follow model-based multi-scale integration of ex¬ 
perimental data as a field of active research. The development of parameter optimisation, uncertainty 
analysis and model selection for multi-scale models is in its early phase. Computational complexity 
(Challenge 2.1), stochasticity (Challenge 2.2), reproducibility (Challenge 2.3) and the lack of computa¬ 
tional tools (Challenge 2.4) provide limitations to what is currently feasible. There are however many 
interesting ideas and approaches waiting to be explored. In particular, the use of approximation, 
reduced order models and surrogate models is promising. The derivation of appropriate surrogate 
models might even be automated as no detailed analysis of the governing equations is required but 
merely input-output data are needed. However, to use surrogate-based approaches with stochastic 
models additional robustness improvements are required. 

In addition to the development of appropriate inference methods, it has to be decided which complexity 
is truly necessary. Sometimes simplistic models might be more appropriate than detailed models, 
although the later are more realistic. If the detailed models cannot be parameterised using available 
data and inference methods, it is unclear what we can truly learn using them. We share the opinions 
of two well known scientists: “Simplicity is the ultimate sophistication” (Leonardo da Vinci), and 
“everything should be made as simple as possible, but no simpler” (Albert Einstein). Accordingly, we 
might not search for the most appropriate and detailed model, but for the model with the highest 
reliable informativeness [245]. Extensions to such concepts to multi-scale dynamical models would be 
very interesting. 

In summary, in the last decade multi-scale modelling already contributed significantly to improving 
our understanding of complex biological systems. Flagship projects, like the Virtual Heart and the 
Virtual Liver, illustrated how multi-scale modelling can be exploited. Nowadays, there is a broad 
spectrum of simulation environments for multi-scale modelling, however, methods to parameterise the 
models are mostly missing. Mathematical and computational research has to be fostered to solve 
this problem and initiatives, e.g., DREAM challenges, are needed to carefully evaluate the developed 
methods. The availability of inference tools might turn multi-scale modelling into a standard tool in 
biological sciences, similar to powerful new measurement devices. Paraphrasing the idea of Galileo 
Galilei: We should “measure what can be measured and make the rest measurable” using multi-scale 
models. 
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